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DGScription 

MULTIPLE INTELLIGENT AGENTS HAVING VARIED DEGREES OF DOMAIN 
KNOWLEDGE 



5 Cross-reference to Related Applications 

This application is related to the following U.S. 
Patent Applications/ all of which were filed on even date 
herewith by Bigus et al : U-S. Serial No. 08/821,935 entitled 
"INTELLIGENT AGENT WITH NEGOTIATION CAPABILITY AND METHOD OF 

10 NEGOTIATION THEREWITH", U.S. Serial No. 08/822,119 entitled 
"APPARATUS AND METHOD FOR COMMUNICATING BETWEEN AN INTELLIGENT 
AGENT AND CLIENT COMPUTER PROCESS USING DISGUISED MESSAGES" and 
U.S. Serial No. 08/822,993 entitled "APPARATUS AND METHOD FOR 
OPTIMIZING THE PERFORMANCE OF COMPUTER TASKS USING INTELLIGENT 

15 AGENT WITH MULTIPLE PROGRAM MODULES HAVING VARIED DEGREES OF 

DOMAIN KNOWLEDGE". The disclosures of all of these applications 
are hereby incorporated by reference herein. 

Field of the Invention 

The invention is generally related to intelligent 
20 agent computer programs executable on computer systems and the 
like, and in particular, the use of such agents for performing 
specific tasks in a computer system. 

Background of the Invention 

Since the advent of the first electronic computers 
25 in the 1940 's, computers have continued to handle a greater 
variety of increasingly complex tasks. Advances in 

semiconductors and other hardware components have evolved to 
the point that current low- end desktop computers can now handle 
tasks that once required roomfuls of computers. 
30 Computer programs, which are essentially the sets' of 

instructions that control the operation of a computer to 
perform tasks, have also grown increasingly complex and 
powerful. While early computer programs were limited to 
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performing only basic mathematical calculations, current 
computer programs handle complex tasks such as voice and image 
recognition, predictive analysis and forecasting, multimedia 
presentation, and other tasks that are too numerous to mention • 

However, one common characteristic of many computer 
programs is that the programs are typically limited to 
performing tasks in response to specific commands issued by an 
operator or user. A user therefore must often know the 
specific controls, commands, etc. required to perform specific 
tasks. As computer programs become more complex and feature 
rich, users are called upon to learn and understand more and 
more about the programs to take advantage of the improved 
functionality. 

m addition to being more powerful, computers have 
also become more interconnected through private networks such 
as local area networks and wide area networks, and through 
public networks such as the Internet. This enables computers 
and their users to interact and share information with one 
another on a global scale. However, the amount of information 
is increasing at an exponential rate, which makes it 
increasingly difficult for users to find specific information. 

As a result of the dramatic increases in the both 
complexity of computer programs and the amount of information 
available to users, substantial interest has developed in the 
area of intelligent agent computer programs, also referred to 
as intelligent agents or simply agents, that operate much like 
software- implemented "assistants" to automate and simplify 
certain tasks in a way that hides their complexity from the 
user. With agents, a user may be able to perform tasks without 
having to know specific sequences of commands. Similarly, a 
user may be able to obtain information without having to know 
exactly how or where to search for the information* 

Intelligent agents are characterized by the concept 
of delegation, where a user, or client, entrusts the agents to 
handle tasks with at least a certain degree of autonomy. 
Intelligent agents operate with varying degrees of constraints 



wo 98/43190 PCT/US98/04942 

depending upon the amount of autonomy that is delegated to them 
by the user. 

Intelligent agents may also have differing 
capabilities in tenns of intelligence, mobility, agency, and 
5 user interface. Intelligence is generally the amount of 
reasoning and decision making that an agent possesses. This 
intelligence can be as simple as following a predefined set of 
rules, or as complex as learning and adapting based upon a 
user's objectives and the agent's available resources. An 

10 intelligent agent's intelligence or skill as applied to a 
specific field or function is referred to as domain knowledge. 

Mobility is the ability to be passed through a 
network and execute on different computer systems. That is, 
some agents may be designed to stay on one computer system and 

15 may never be passed to different machines, while other agents 
may be mobile in the sense that they are designed to be passed 
from computer to computer while performing tasks at different 
stops along the way. User interface defines how an agent 
interacts with a user, if at all. 

20 Agents have a number of uses in a wide variety of 

applications, including systems and network management, mobile 
access and management, information access and management, 
collaboration, messaging, workflow and administrative 
management, and adaptive user interfaces. Another important 

25 use for agents is in electronic commerce, where an agent may 
be configured to seek out other parties such as other users, 
computer systems and agents, conduct negotiations on behalf of 
their client, and enter into commercial transactions. 

The relative performance of intelligent agents in 

30 handling certain tasks may vary between agents, e.g. , depending 
upon the degree of domain knowledge possessed thereby. 
However, an agent which performs well when handling a 
particular task in one instance may not perform as well when 
handling other tasks, or even when handling the same type of 

35 task in other situations and under other circumstances. As a 
typical agent may experience a wide variety of situations when 
it is operating in a real world environment, a given. agent will 
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not always be able to provide its client with optimal 
performance. Moreover, it is often difficult, if not 
impossible, to develop an agent that is capable of handling all 
possible situations with equally proficient results. 

For example, in electronic commerce applications, 
security is a major concern. Intelligent agents may be 
entrusted with the authority to sell or purchase products on 
behalf of a client, and may be dispatched to any number of 
computer systems in a network to perform negotiations. Often, 
the computer systems to which the agents are dispatched are 
remote to the client's system, and thus not under the direct 
control of the client. An agent may thus be subject to 
scrutiny by third parties, either through monitoring 
transmissions between the agent and its client, or through 
actual scanning and reverse compiling of the program code of 
the agent. In some instances, the risk may be segregated into 
transmission risk, which is associated with the transmission 
media between the agent and the client, and environment risk, 
which is associated with the environment in which the agent is 
resident. Each type of risk may present unique concerns for 
a given agent. 

Beyond the clear risk of discovering confidential 
information such as a client's credit card or bank account 
number, additional risks are posed when an agent is subject to 
scrutiny. As an example, an agent would be placed at a 
competitive disadvantage if any of its negotiation strategies, 
techniques or plans, or any specific instructions from its 
client, were discovered by other parties. If a message to an 
agent from its client indicates that the agent should offer 
$100, but is authorized to go as low as $90, another party that 
intercepts this message immediately knows that a transaction 
may be completed at the lower price. Also, even if the message 
is not intercepted, if the agent has stored the $9 0 price as 
its lowest authorized offer, reverse compilation or scanning 
of the agent by another party may lead to similar results. 

The degree of domain knowledge such as autonomy 
granted to an agent by its client, and the remaining domain 
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knowledge retained by the client, may vary significantly for 
different agents. In general, however, the more domain 
knowledge that is imparted to an agent, the greater the 
security risk presented to the agent and its client. 

Often, a tradeoff must be made between absolutely 
secure agent transactions with prohibitively high costs, and 
less secure agent transactions with acceptable costs. The 
greatest security would be obtained when most, if not. all, of 
the domain knowledge for conducting negotiations is retained 
by the client, with only a limited intermediary agent sent out 
to relay messages between the client and the other party being 
negotiated with. However, this configuration would present 
significant time and cost burdens on the client since all 
decision making would remain with the client, and since 
multiple messages would likely need to be transmitted between 
the client and the other party. Given these likely burdens, 
there would be little, if any, benefit in utilizing an agent 
at all. This would especially be the case in operating 
environments where the security risks are known to be small. 

Security risks are also present for other 
applications of intelligent agents. For example, a client may 
have developed an agent with a unique search engine that 
greatly outperforms its competitors* designs. In certain 
operating environments, the agent may be more likely to be 
intercepted and copied by competitors -- a situation that the 
client- would prefer to avoid. However, without an agent 
mechanism that appropriately balances cost and security, the 
industry will continue to be limited by today's solutions • 

Summary of the Invention 

The invention addresses these and other problems 
associated with the prior art in providing a manner of 
optimizing the performance of a given computer task by 
utilizing a plurality of intelligent agents suited to perform 
the computer task but having varied degrees of domain 
knowledge. Based upon an objective criteria that is determined 
for a given situation, one of the plurality of intelligent 
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agents is selected and dispatched to perform the task. 
Consequently, the performance of the computer task is optimized 
for a wide variety of situations. 

In an exemplary embodiment of the invention, 
5 intelligent agents having varied levels of autonomy are used 
to conduct negotiations, with the projected security risk for 
a given negotiation being used as the objective criteria by 
which agents are selected to conduct the negotiations. In 
unknown or untested environments, or in environments in which 

10 known security risks are present, a limited intermediary agent 
may be dispatched, leaving the majority of domain knowledge 
such as authority to complete transactions with the client. 
For other environments where the security risk is known to be 
not as great, agents having more autonomy to conduct 

15 negotiations and complete transactions independent of the 
client may be dispatched, thereby relieving the client of much 
of the burden of handling these particular negotiations. 
However, it should be appreciated that a vast number of 
objective criteria for selecting different agents may be used 

20 in the alternative, and thus the invention is not limited to 
this particular embodiment or application. 

These and other advantages and features, which 
characterize the invention, are set forth in the claims annexed 
hereto and forming a further part hereof. However, for a 

25 better understanding of the invention, and of the advantages 
and objectives attained through its use, reference should be 
made to the Drawing, and to the accompanying descriptive 
matter, in which there is described illustrated embodiments of 
the invention. 

30 Brief Description of the Drawing 

FIGURE 1 is a flowchart illustrating the operation 
of an agent manager in selecting one of a plurality of 
intelligent agents with varied degrees of domain knowledge to 
perform a computer task, consistent with the invention. 
3 5 FIGURE 2 is a flowchart illustrating the operation 

of an agent manager in selecting within an intelligent agent 
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one or more of a plurality of program modules with varied 
degrees of domain knowledge to perform a computer task, 
consistent with the invention. 

FIGURE 3 is a flowchart illustrating the operation 
5 of an intelligent agent in selecting one or more of a plurality 
of program modules therein having varied degrees of domain 
knowledge to perform a computer task, consistent with the 
invention. 

FIGURE 4 is a block diagram of a networked computer 
10 system environment for use with the illustrated embodiments of 
the invention. 

FIGURE 5 is a block diagram of the networked computer 
system of Fig. 4, illustrating the interaction between 
intelligent agents therein. 
15 FIGURE 6 is another block diagram of the networked 

computer system of Fig, 4, illustrating in greater detail the 
primary components in the client and remote computer systems. 

FIGURE 7 is a block diagram of an exemplary agent 
manager and reinforcement learning module consistent with the 
20 invention, and for use in the selection of one of a plurality 
of agents having varied degrees of domain knowledge. 

FIGURE 8 is a flowchart illustrating the operation 
of a semi - autonomous agent consistent with invention. 

FIGURE 9 is a flowchart illustrating the operation 
25 of a fully -dependent agent consistent with invention. 

FIGURE 10 is a flowchart illustrating the operation 
of a fully -autonomous agent consistent with invention. 

FIGURE 11 is a block diagram of an exemplary 
intelligent agent having multiple program modules of varied 
30 degrees of domain knowledge. 

FIGURE 12 is a block diagram of another exemplary 
intelligent agent constructed from program modules from one or 
more pools of program modules. 
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Detailed Description f the Illustrated Embodiments 

The embodiments of the invention are principally 
directed to optimizing the performance of computer tasks by 
intelligent agents in a wide variety of situations. 
5 Intelligent agents are computer programs which have 

been delegated a degree of autonomy but which are limited to 
operating within constraints defined by their client (which may 
be, for example, another agent; a computer program, application 
or system; or an individual interacting with the agent through 

10 a computer hereinafter a client computer process) . A subset 
of such agents which are capable of being passed between and 
operating in different applications or computer systems are 
referred to as mobile agents. In certain circumstances, the 
functions of a mobile agent, combined with the functions of an 

15 agent manager program that is resident in a client's computer 
system and interacts with the mobile agent, may cooperatively 
be considered as portions of a single intelligent agent where 
the domain knowledge has been allocated between home and mobile 
portions. However, as used herein, the term agent may be 

20 considered to include simply a portion or module of an agent 
resident in a computer system irrespective of whether any 
portion of the functionality for the agent as a whole is 
resident on another computer system or in another computer 
process. 

25 In general, an agent has the ability to sense, 

recognize and act. These functions are typically embodied in 
a number of components, including an engine, a knowledge 
component, an adapters component, a library, and a view 
component. Generally, the engine controls the overall 

30 operation of an agent and functions as the "brains" of the 
agent, and the knowledge component stores information that is 
representative of the acquired knowledge of the agent. The 
adapters component handles interaction between an agent and 
external objects, thereby functioning as the mechanism through 

35 which the agent "senses" and interacts with its environment. 

A library persistently stores the information utilized by the 
knowledge component, and the view component provides the human 
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interface, if any, for an agent, e.g., for supplying 
instructions to the agent. 

Moreover, as discussed above, agents may have varied 
degrees of domain knowledge that is, intelligence and/or 
5 skills applied to a specific field or function, typically 
applied to a particular computer task. This may be exhibited 
through greater or lesser degrees of autonomy or authority, 
alternate processing strategies, different types and/or amounts 
of stored information, different sources of information, 

10 different capabilities (e.g., monitoring functionality vs. 
monitoring and response functionality) , different goals or 
objective functions, different inputs or sensed values to which 
an agent may be responsive, alternate logic implementations 
(e.g., neural networks vs. procedural logic), etc. 

15 For example, in the illustrated embodiments discussed 

in greater detail below, domain knowledge is represented in 
part by the degree of autonomy granted to an agent to conduct 
negotiations in electronic commerce applications as well as the 
inputs to which the agent is responsive and the knowledge 

20 present in the agent. In addition, in other electronic 
commerce applications, domain knowledge may represent different 
manners or strategies for achieving a desired goal, e.g., 
optimizing a negotiation strategy for one agent to maximize 
profit, while optimizing a strategy for another agent to 

25 maximize the number of completed transactions. Domain 
knowledge in such applications may also be market dependent, 
e.g., where agents are optimized for automobile markets, stock 
markets, etc. 

AS another example, domain knowledge may represent 
30 a particular indexing or searching algorithm used by a data 
mining agent, e.g., where one particular searching algorithm 
may work better for legal cases. while another may work better 
for technical documents. In other data mining applications, 
domain knowledge for one type of agent may include only . the 
35 ability to monitor for problems, e.g., looking for low stock 
or late deliveries, in an inventory monitoring agent. In. other 
agents, the domain knowledge may also include the ability to 
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respond, e.g. , in the above inventory application, ordering low 
stock items, ordering alternates for late deliveries, or 
canceling orders when sales are low. 

Domain knowledge may also simply represent different 
manners of performing the same or similar general task, e.g., 
for the purpose of disguising the identity of the agents' 
client, since differently- configured agents may present 
different characteristics when scanned by third parties. For 
example, this may permit differently-configured agents to be 
dispatched from a single client to conduct negotiations with 
a single party so that the agents play off one another to 
obtain more favorable results for the client. 

Intelligent agents are typically programmed to handle 
particular computer tasks, which may include practically any 
operation performed on a computer. For example, in electronic 
commerce applications, the computer tasks for which agents are 
programmed to handle are typically that of conducting 
negotiations and completing transactions. For data mining 
applications, the computer tasks may be the indexing or 
retrieval of desired information from one or more databases. 

Consistent with the invention, one or more objective 
criteria are used in the selection of one of a plurality of 
agents, or alternatively, one of a plurality of program modules 
in an agent, such that a given computer task is handled with 
optimal performance for its given situation. The plurality of 
agents (or program modules) are all configured to handle a 
common general computer task (e.g., "buy a car", "get me 
everything you can find out about company X", etc.), although 
the relative performance, capabilities, manners, and mechanisms 
by which the different agents (or program modules) operate to 
handle the task may nonetheless differ. 

A program module (or mechanism) may be considered to 
include any function, object, or block of program code 
configured to perform a given computer task. A suitable 
objective criteria may represent practically any testable or 
measurable variable or variables that varies in different 
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situations to which an agent may be exposed, and that may 
affect an agent's relative performance under those situations. 

For example, in the illustrated embodiments discussed 
in greater detail below, the objective criteria is the 
perceived risk to an agent dispatched to a remote computer 
system, whether due to known risks or due to a lack of 
knowledge as to the risk inherent in an untested environment. 
The obj ective criteria may also include relative compute or 
transmission speeds, or the perceived risk. of a particular 
transmission media, which may be used to determine whether 
domain knowledge related to disguising or encrypting messages 
is implemented in a dispatched agent. 

Other examples of objective criteria may include the 
source of information for data mining applications (e.g., 
particular domains or databases, or types of data), user 
approval or feedback (e.g., a qualitative assessment of 
performance by a user) , and profits or costs for electronic 
commerce applications , among others. 

Selection of agents or program modules within agents 
based upon the objective criteria will vary depending upon the 
particular objective criteria and types of situations to which 
the. agent (s) may be exposed. For example, for the illustrated 
embodiments discussed in greater detail below, the risk to^an 
agent may be tested based upon the IP address of the remote 
system to which an agent is to be dispatched, and any 
identifying information about the other party to a negotiation, 
among others. Separate risk factors may also be monitored for 
the transmission media and the agent's resident environment 
such that domain knowledge specific to each type of risk may 
be separately utilized as necessary. 

For data mining applications, . the source of 
information could be tested based upon the IP address of the 
database being searched, the type of information being sought, 
etc. In electronic commerce and other applications, the 
reliability of data could also be tested by looking at the 
source thereof (e.g., prices retrieved from a bulletin board 
may be more reliable than prices quoted in a magazine article) , 
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and/or by comparing the similarity of the data to the type of 
information being sought. Also, the volatility of a market or 
the type, quantity or value of particular classes of goods may 
be tested (e.g., a car buying agent may warrant or require a 
more aggressive negotiation strategy than a compact disc buying 
agent) . 

In addition, selection based upon an objective 
criteria may occur dynamically, and may even be updated from 
time to time. For example, an agent may be configured to 
dynamically select program modules in response to changing 
circumstances such as changes in market conditions, changes in 
size, speed, or other capacity of a remote site, and new input 
from a client (e.g., as related to information that is 
otherwise unavailable to an agent while resident in a remote 
system), among others. 

Agents or program modules may be matched with 
particular values of an objective criteria to assist in the 
selection of agents or program modules. The matching may be 
as simple as a mapping or table matching agents/program modules 
and values of the objective criteria, or may include predictive 
and interpretive logic such as provided by factor weighting, 
neural networks, expert systems, fuzzy logic systems, etc. to 
attempt to select the most appropriate agent/program module 
even when no direct match exists for a given value of the 
objective criteria. 

Moreover, selection based upon the obj ective criteria 
may be adaptively controlled. For example, the actual results 
and relative performance of agents dispatched to perform 
particular tasks may be analyzed and used to modify the 
selection of agents for given values of the objective criteria. 
To this extent, a reinforcement learning module, implemented 
for example in a neural network, may be particularly well 
suited for use in agent or program module selection consistent 
with the invention. 

As discussed above, the objective criteria may be 
used to select between a plurality of agents, or between a 
plurality of program modules in an agent, that have varied 



wo 98/43190 



PCT/US98/04942 



degrees of domain knowledge. The program flow for dispatching 
one of a plurality of agents based upon an objective criteria 
is illustrated by agent manager 200 in Fig. 1. In block 201, 
a request to perform a task is received, and in block 202 the 
5 objective criteria is tested by the agent manager. Next, in 
block 204, an agent is selected, based upon the objective 
criteria, from a pool of agents having varied degrees of domain 
knowledge. In block 206, the selected agent is dispatched to 
perform its task, and in block 208, the performance of the 

10 agent upon completion of the task is optionally analyzed if 
adaptive agent selection is utilized. 

Similarly, the program flow for selecting one or more 
of a plurality of program modules based upon an objective 
criteria is illustrated by agent manager 210 in Fig. 2. In 

15 block 211, a request to perform a task is received, and in 
block 212 the objective criteria is tested by the agent 
manager. : 

Next, in block 214, one or more program modules are 
selected, based upon the objective criteria, from a plurality 

20 of program modules having varied degrees of domain knowledge. 

In some embodiments, one of a plurality of available program 
modules may be selected. For example, program modules may have 
duplicate functionality that is implemented in alternate 
manners. In such instances, only one module may be selected. 

25 In other embodiments, individual program modules may 

have additive or non- overlapping functionality so that multiple 
program modules (e.g., a subset of program modules) are 
selected in a given agent. For example, one program module may 
provide basic domain knowledge, with additional program modules 

30 selected as necessary to add additional domain knowledge to 
more sophisticated agents. 

In addition, the agent may need to be configured for 
proper execution of the appropriate program module or modules. 
For example, if an agent is provided with both selected and 

35 non-selected program modules (e.g., where duplicative 
functionality is provided by different alternative modules) , 
an agent may be configured by initializing, sending a message 
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or setting a variable in the agent specifying which program 
module or modules to execute. In the alternative, an agent may 
be constructed from component parts prior to its dispatch, 
whereby the agent may be configured by bundling all of the 
selected program modules with any necessary components that are 
generic to all of an agents' permutations. 

Next, in block 216, the configured agent is 
dispatched to perform its task, and in block 218, the 
performance of the agent upon completion of the task is 
optionally analyzed if adaptive selection is utilized- 

It should also be appreciated that the functions of 
testing the objective criteria, selecting agents or agent 
program modules based upon the objective criteria, and/or 
analyzing the performance of agents need not be performed by 
an agent manager or other computer process resident in the 
client computer system, but may be performed by any suitable 
evaluation module or program. In addition, performance 
analyzation may also rely on a user's subjective assessment. 
A separate agent may be utilized as an evaluation module to 
perform any or all of these functions. Also, any of these 
functions may be implemented in the agent that has been 
dispatched to perform a task, whereby the agent includes an 
evaluation module. 

For example, as illustrated in Fig. 3, an agent 220 
may receive a task request in block 221, test the objective 
criteria in block 222, select the appropriate program module 
in block 224 and execute the selected program module in block 
226 to perform the task. In addition, the agent may analyze 
its performance in block 228 if desired. The above -described 
configuration has a number of benefits. For example, in 
electronic commerce applications, an agent could be dispatched 
with a task to seek out negotiations on multiple remote 
systems, whereby the agent would have the ability to 
dynamically determine the risk at each remote system and alter 
its behavior accordingly. 
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Exemplary Hardware/Software Environment 

A representative hardware environment suitable for 
use with the illustrated embodiments of the invention is 
illustrated in Fig. 4, where a networked computer system 10 
generally includes one or more computer systems, e.g., single- 
user computer systems 16, 18 and multi-user computer systems 
20, 60, coupled through a network 15. Multi-user computer 
system 20 typically includes one or more servers 25 to which 
one or more single -user computers 22 may networked through a 
separate network 24. Similarly, multi-user computer system 20 
typically includes one or more servers 65 coupled to one or 
more single -user computer systems 62 through a network 64, 
Network 15 may represent any type of networked interconnection, 
including but not limited to local -area, wide-area, wireless, 
and public networks (e.g., the Internet). 

It is anticipated that agents consistent with the 
invention may originate in and be resident from time to time 
on any of the above-mentioned computer systems. One possible 
distinction between the computer systems for the purposes of 
the, invention may be whether each is a client or a remote 
system relative to a particular agent. For example. Fig. 5 
illustrates an embodiment of computer system 10 where multi- 
user computer system 20 is a client system, and multi -user 
computer system 60 is a remote system. 

A client system will hereinafter refer to a computer 
system that provides an agent a certain level of security from 
manipulation by other parties when the agent is resident on the 
system. The client system is also the computer system from 
which management of the agent is typically handled. The agent 
typically but not necessarily will also originate from the 
client system. 

A remote system, on the other hand, will hereinafter 
refer to a computer system that is typically not capable of 
providing a desired level of security for an agent, generally 
because the computer system is not under the control of the 
client. It is typically while resident on a remote system that 
an agent runs the greatest risk of being scanned or reverse 
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compiled, or of having communications intercepted or monitored, 
by other parties. 

The various embodiments described hereinafter have 
principal uses in electronic commerce applications, where 
agents are configured to negotiate commercial transactions, 
generally in the role of buying or selling agents. The agents 
may negotiate with other agents, other computer systems, or 
even other individuals. The agents may interact one-.on-one, 
or may be capable of operating within a t>markett> of multiple 
agents, along the lines of a stock or commodity market. 
Computer systems having the ability to host agents for 
interaction therebetween include negotiating programs of 
varying sophistication and are hereinafter referred to as agent 
hosts. However, it should be appreciated that the invention 
applies equally to other applications of intelligent agents, 
and thus should not be limited specifically to electronic 
commerce applications . 

Fig. 5 illustrates a mobile intelligent agent 100 
which communicates with an agent manager 32 in client system 
20. During negotiation with another party such as negotiating 
agent 95, mobile agent 100 is resident on remote system 60. 
It should be appreciated that remote system 60 may be the 
client for agent 95, or may also be considered to be remote 
relative to this agent as well. 

An exemplary functional design of networked computer 
system 10 for implementing various embodiments of the invention 
is illustrated in Fig. 6. Server 25 of client system 20 
generally includes a central processing unit (CPU) 28 coupled 
to a memory 30 and storage 40 over a bus 54. A local area 
network interface is provided at 52, and an interface to remote 
system 60 over external network 15 is provided through 
interface 50. Agent manager program 32 is resident in memory 
30, as is a reinforcement learning module 34 that enables the 
agent manager to adaptively optimize the performance of tasks. 
Storage 40 includes one or more agents 4 2 (of which may include 
agent 100, for example) , which are computer programs or modules 
that may be retrieved and used locally within system 20, or 
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dispatched to remote systems to execute and perform tasks on 
behalf of the client system. Storage 40 also includes an agent 
mission database 44 which may track agent operations and the 
relative success or failure thereof. 
5 Server 65 of remote system 60 also includes a CPU 68 

coupled to a memory 70, storage 80, external network connection 
90 and local network connection 92 over a bus 94. An agent 
host program 72 is resident in memory 70 to handle interactions 
between agents resident in the remote system. Typically, the 

10 agent host program is an asynchronous message/event driven 
environment that provides a common platform over which agent 
computer programs execute and interact, much like an operating 
system. The agent host is also capable of permitting messages 
to be sent between agents and their clients. Memory 70 also 

15 includes a negotiating program 74 which operates as the toother 
partyj) in transactions with agent 100, which may be another 
agent, a market or bulletin board application, or even an 
interface program through which an individual interacts with 
agent 100. Storage 80 maintains a transaction history database 

20 82 which logs the transactions completed on the server. 

Servers 25, 6 5 may be, for example, AS/400 midrange 
computers from International Business Machines Corporation; 
However, it should be appreciated that the hardware embodiments 
described herein are merely exemplary, and that a multitude of 

25 other hardware platforms and configurations may be used in the 
alternative. 

Moreover, while the invention has and hereinafter 
will be described in the context of fully functioning computer 
systems, those skilled in the art will appreciate that the 

30 various embodiments of the invention are capable of being 
distributed as a program product in a variety of forms, and 
that the invention applies equally regardless of the particular 
type of signal bearing media used to actually carry out the 
distribution. Examples of signal bearing media include but are 

35 not limited to recordable type media such as floppy disks, hard 
disk drives, and CD-ROM's, and transmission type media such as 
digital and analog communications links. 
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Negotiating A'Qents with Adaptive Agent Selection 

Fig. 7 illustrates an exemplary configuration of 
client computer system 20 for providing adaptive agent 
selection consistent with the invention. Agent manager 32 is 
coupled to a reinforcement learning module 34 through which 
analysis of agent performance and control of agent selection 
is performed. Agent manager 32 is shown coupled to a pool of 
agents 4 2 having varied degrees of domain knowledge, or 
alternatively, to one or more pools of program modules 42' 
having varied degrees of domain knowledge. A dispatched agent 
100 communicates with agent manager 32 to conduct negotiations 
on behalf of the client. 

One or more information sources 56 are also coupled 
to agent manager 32 to enable it to obtain the factor or 
factors for testing the objective criteria (in this embodiment, 
security or risk) . Information sources 56 may include internal 
databases updated by manager 32 or other components in client 
system 20, as well as external databases. 

For example, risk may be determined by maintaining 
a database of one or more risk factors for particular 
destinations, addresses or parties being negotiated with. To 
this extent, information about a remote system or other agents 
may be obtained by the agent manager, such as a name or 
identification, an operating system, a type of communication 
port, a client, a bank and/or bank account number, a homebase 
address, the name or identification of an agent program, the 
size of an agent program, where messages and other 
communications with an agent originate, and/or the pattern of 
input/output (I/O) compared to CPU cycles for I/O 
transmissions. Also, the agent manager may attempt to retrieve 
a credit card number or bank account number from an unknown 
agent and validate the number. Moreover, an unknown agent may 
be scanned and compared to other known agents, e.g., comparing 
the percentage of identical code, determining the language the 
agent was written in, or searching for unique patterns in much 
the same manner as a virus checking program. 
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Risk may also be analyzed over time based upon the 
actual performance of dispatched agents, e.g., by monitoring 
the number of dispatched agents that disappear, or the length 
of time between communications from dispatched agents. Also, 
risk may be analyzed by monitoring dispatched agents for 
tampering, e.g., by performing checksums or cyclic redundancy 
checks, analyzing binary signatures, etc. Either the agent 
manager or the agents themselves may be used to detect any 
tampering. Risk may also be analyzed by monitoring the 
behavior of other parties being negotiated with. 

In addition, risk may be measured based upon the 
profitability of agents. For example, the performance of a 
negotiating agent may be measured in terms of the number of 
complete transactions, gross income of all transactions, net 
profit of all transactions, etc. The performance may then be 
offset by the costs of using the agent, e.g., in terms of the 
time required to complete transactions, local/remote CP.U 
charges, communication (on-line) charges, etc. If an agent is 
compromised in some way, where another party gains a 
negotiating advantage, a negative impact on profitability would 
be observed. 

Agent manager 32 provides inputs to reinforcement 
learning module 34 to permit module 34 to perform the functions 
of estimating risk over time and selecting agents or agent 
program modules. Module 34 maintains a mapping between 
agents /program modules and different levels or types of risk, 
so that an appropriate agent/program module is selected 
depending upon the measured level or type of risk to a 
dispatched agent under the current circumstances. 

Agent manager 32 provides optimum agent performance 
data related to a desired task for which to test the objective 
criteria. In response, module 34 outputs a control action 
output back to agent manager 32 to control which agent or agent 
program module should be selected and dispatched for performing 
the given task. Also, when a dispatched agent is returned and 
analyzed, agent manager 32 provides actual performance data to 
adaptively modify the selection functionality of module 34, 
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In the illustrated embodiment, an adaptive heuristic 
critic neural network architecture is utilized in module 34 to 
provide adaptive reinforcement learning. The design, training 
and operation of an adaptive heuristic critic neural network 
5 is well known in the art. For example, one suitable 
development environment for performing the development and 
training of such a neural network is the IBM Neural Network 
Utility available from International Business Machines 
Corporation, among others. 

10 Module 34 includes a sensor 36 which compares desired 

(optimum) agent performance and actual agent performance and 
generates a scalar reinforcement reward/penalty value 
therefrom. * A critic network 37 provides temporal difference 
learning in response to the actual agent performance data and 

15 the reinforcement value from the sensor. Critic network 37 
outputs a utility signal that is back propagated to a plurality 
of action networks 38. 

An action network 3 8 is provided for each agent or 
program module. Each action network receives agent performance 

20 data and outputs therefrom an action merit signal to a 
stochastic action behavior block 39. Block 39 

probabilistically chooses an action (here, which one of the 
agents or program modules to dispatch) , and outputs therefrom 
a control action signal representative thereof to agent manager 

25 32. Block 39 introduces randomness to agent selection to widen 
its search space, which has been found to improve long term 
performance of the module. 

Module 34 adaptively modifies its selection of agents 
based upon the analyzed performance of a dispatched agent. If 

30 an agent performs well, sensor 36 reinforces critic network 37, 
which in turn reinforces the corresponding action network 3 8 
such that it is more likely that the same agent/program module 
is selected under similar circumstances. On the other hand, 
if an agent performs poorly, sensor 3 6 negatively reinforces 

35 critic network 37, which in turn reduces its utility output to 
action networks 38 to reduce the likelihood that the same 
agent/program modules is selected under similar circumstances. 
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In the alternative, other reinforcement learning 
modules and algorithms may be used. For example, other neural 
network architectures, such as Q- Learning neural networks, 
Bayesian networks, dynamic programming techniques, among 
5 others, may be used* 

Module 34 may be initially trained in a manner known 
in the art, e.g., by using example training records. In the 
alternative, module 34 may simply be initialized with 
predetermined or random networks which act as a starting 

10 reference point from which the module adaptively learns. The 
former approach often improves the quality of the initial 
decisions made by the module in operation. 

Agent pool 42 includes a plurality of agents in which 
the domain knowledge thereof varies in the degree of autonomy 

15 delegated thereto during negotiations, the internal knowledge 
therein, and the inputs to which an agent is responsive 
thereto. At one extreme, a simple intermediary agent, or fully 
dependent agent, may have little domain knowledge in that the 
agent merely serves the function of passing messages between 

20 a client and another party to a negotiation, such that 
authority for determining offers, accepting or rejecting 
offers, implementing negotiation strategies, etc. rests fully 
with the client. At the other extreme, a fully autonomous 
agent may be provided with the basic parameters for a desired 

25 transaction and thereafter entrusted with the authority to 
complete such a transaction free from any additional input on 
the part of the client. Between these extremes, one or more 
additional agents, referred to as semi -autonomous agents, may 
be entrusted with the ability to conduct limited negotiations. 

30 One possible semi -autonomous agent may be authorized to conduct 
negotiations for a limited number of offer iterations, but with 
final authority for approving an agreed-upon transaction still 
resting with the client. Other possible degrees of domain 
knowledge between the extremes are also possible. 

35 Fig. 8 illustrates an. exemplary program flow for 

negotiation with a semi -autonomous agent which has authority 
to negotiate with another party with limits on the offer amount 
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and the number of iterations (offers made by the agent) that 
the agent can undertake on its own. The interaction between 
the agent and an agent manager on a client system, and the 
other party being negotiated with (another agent, computer 
5 process or system, or individual interacting through a 
computer) , is also illustrated. 

For the semi -autonomous agent, negotiations are 
instituted after the agent manager selects the agent from the 
agent pool and dispatches the agent, as illustrated in blocks 

10 130 and 131. It should be appreciated that the agent may be 
dispatched with a desired transaction or transactions which the 
client wishes to complete (including information such as 
desired quantity, description of goods/services to be 
sold/purchased, etc.) , as well as any other constraints placed 

15 upon the agent by the manager (e.g., the number of negotiation 
iterations to process, a maximum or minimum limit on the amount 
to offer, etc.). In the alternative, some or all of this 
information may be relayed to the agent at a later time by the 
agent manager. 

20 Next, in block 132, the agent gathers information 

after it has been dispatched to the remote system. For 
example, the agent may poll a bulletin board or market for 
outstanding offers, or may attempt to find and initiate 
negotiations with another party, e.g., another computer system, 

25 another agent, or an individual interacting through a computer 
system. The agent may also simply wait for incoming offers 
from other parties. Block 132 also performs the functions of 
processing messages from other parties and from the agent 
manager. 

30 Mext, in response to input received by the agent, a 

decision is made in block 134. It is in this block that a 
negotiation strategy of the agent is implemented, whereby the 
agent determines, based upon the input it receives, whether and 
how to proceed in negotiations with another party. 

35 Generally, one of three actions may be instituted in 

response to block 134. First, as illustrated by block 136, a 
message may be sent to the other party, e.g., indicating 
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acceptance or rejection of an outstanding offer, issuing an 
offer or counteroffer, terminating negotiations, requesting 
that the other party wait, etc. The other party receives the 
message at block 138 and processes it at block 140, typically 
5 issuing a response message back to the agent. Control then 
returns to block 132 to process the response message from the 
other party • 

Second, as illustrated by block 142, the agent may 
decide to send a message back to the agent manager. The agent 

10 manager then receives the message at block 144, processes the 
message in block 146 and sends a response message back to the 
agent in block 148. In block 150, the agent receives the 
message and passes control to block 132 to process the response 
message from the agent manager. Also, as disclosed in U.S. 

15 Patent Application Serial No. 08/822,119, entitled "APPARATUS 
AND METHOD FOR COMMUNICATING BETWEEN AN INTELLIGENT AGENT AND 
CLIENT COMPUTER PROCESS USING DISGUISED MESSAGES" (which has 
been incorporated by reference) , the messages between the agent 
and its manager may be disguised to impede scrutiny by third 

20 parties. 

Third, as illustrated by block 152, the agent may 
decide to leave the remote system and return to its client 
system, e.g., after completion of a successful transaction or 
after termination of an unsuccessful negotiation. An agent may 

25 "leave" when its program code is moved or transmitted from the 
remote system to the client system. In the alternative, an 
agent may transmit any necessary information (e.g., domain 
knowledge and/or results) to the client system and terminate 
on the remote system. 

30 The return of the agent is illustrated in block 154. 

Then, in block 156, the performance of the agent may be 
analyzed in the manner discussed above. 

Fig. 9 illustrates an exemplary program flow for 
negotiation with a fully -dependent agent which acts principally 

35 ,as an intermediary with no authority to negotiate with another 
party without specific instructions from the 'agent manager. 
As with the semi -autonomous agent, negotiations are instituted 
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after the agent manager selects the agent from the agent pool 
and dispatches the agent, as illustrated in blocks 160 and 162, 
Next, in block 164, the agent gathers information after it has 
been dispatched to the remote system. Block 164 also performs 
the functions of processing messages from other parties. 

Next, in response to input received by the agent 
(typically as a result of an offer being received from another 
party), a message is sent to the agent manager in block 166. 
The agent manager then receives the message at block 168, 
processes the message in block 170 and sends a response message 
back to the agent in block 172. As with the semi -autonomous 
agent, messages between the agent and agent manager may be 
disguised to impede scrutiny by third parties. 

Next, in block 174, the agent receives the message 
and passes control to block 176 to process the instruction 
provided in the response message. Generally, one of two 
actions may be instituted in response to block 176, First, as 
illustrated by block 178, a message may be sent to the other 
party, e.g., indicating acceptance or rejection of an 
outstanding offer, issuing an offer or counteroffer, 
terminating negotiations, requesting that the other party wait, 
etc. The other party receives the message at block 180 and 
processes it at block 182, typically issuing a response message 
back to the agent. Control then returns to block 164 to 
process the response message from the other party. 

Second, as illustrated by block 184, the agent may 
decide to leave the remote system and return to its client 
system, e.g., after completion of a successful transaction or 
after termination of an unsuccessful negotiation. The return 
of the agent is illustrated in block 186. Then, in block 188, 
the performance of the agent may be analyzed in the manner 
discussed above. 

Fig. 10 illustrates an exemplary program flow for 
negotiation with a fully -autonomous agent which has full 
authority to conduct negotiations with little or no supervision 
by the agent manager. As with the semi -autonomous agent, 
negotiations are instituted after the agent manager selects the 
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agent from the agent pool and dispatches the agent, as 
illustrated in blocks 190 and 191. Next, in block 192, the 
agent gathers information after it has been dispatched to the 
remote system. Block 192 also performs the functions of 
5 processing messages from other parties. 

Next, in response to input received by the agent 
(typically as a result of an offer being received from another 
party) , control passes to block 19 3 to make a decision based 
upon the inputs to the agent. It is in this block that a 

10 negotiation strategy of the agent is implemented, whereby the 
agent determines, based upon the input it receives, whether and 
how to proceed in negotiations with another party. 

Generally, one of two actions may be instituted in 
response to block 193. First, as illustrated by block 194, a 

15 message may be sent to the other party, e.g. , indicating 
acceptance or rejection of an outstanding offer, issuing an 
offer or counteroffer, terminating negotiations, requesting 
that the other party wait, etc. The other party receives the 
message at block 195 and processes it at block 196, typically 

20 issuing a response message back to the agent. Control then 
returns to block 192 to process the response message from the 
other party. 

Second, as illustrated by block 197, the agent may 
decide to leave the remote system and return to its client 
25 system, e.g., after completion of a successful transaction or 
after termination of an unsuccessful negotiation. The return 
of the agent is illustrated in block 19 8. Then, in block 199, 
the performance of the agent may be analyzed in the manner 
discussed above. 

30 Any number of negotiation strategies and features may 

be incorporated into any of the agents in agent pool 42. For 
example, U.S. Patent Application Serial No. 08/821,935, 
entitled "INTELLIGENT AGENT WITH NEGOTIATION CAPABILITY AND 
METHOD OF NEGOTIATION THEREWITH", which has been incorporated 

35 by reference, discloses a suitable negotiation strategy which 
relies on the previous and current offers from the agent as 
well as the previous and current prices offered by the other 
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party. A number of additional negotiation features implemented 
in the aforementioned application may also be used, including 
dynamic value determination, disguising negotiation strategy 
by randomizing an agent's behavior, and limiting unproductive 
negotiations by constraining an agent's behavior, among others. 
Other negotiation strategies and features may also be used 
consistent with the invention. 

Moreover, any of the agents in agent pool 42 may be 
implemented using neural networks for implementing decision 
making and/or message disguising, as disclosed in U.S. Patent 
Application Serial No. 08/822,119, entitled "APPARATUS AND 
METHOD FOR COMMUNICATING BETWEEN AN INTELLIGENT AGENT AND 
CLIENT COMPUTER PROCESS USING DISGUISED MESSAGES" (which has 
been incorporated by reference). For example, for a semi- 
autonomous agent, a neural network may be configured to receive 
the following inputs: 

other party's last price 
other party's current price 
agent's last public offer 
product characteristics 
instruction from manager: 

increase in offer authorized 
increase in iterations authorized 
increase in offer not authorized 
increase in iterations not authorized 
last message sent back to manager 
In addition, the neural network may be configured to 
generate the following outputs : 

public price offer 
message to other party 
make offer . 
withdraw offer 
accept 
rej ect 
- : counteroffer 
please wait 
finished 
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message to manager 

probable acceptable alternative, do you 
accept 

iterations exceeded, recommend continue 
5 -- iterations exceeded, recommend withdrawal 

recommend immediate withdrawal 
dummy transmission to motivate negotiation 
request approval to increase offer 
For a fully -dependent agent, a neural network may be 
10 configured to receive the following inputs: 

other party's last price 
other party's current price 
number of iterations 
agent's last public offer 
15 - product characteristics 

instruction from manager: 
buy or sell 
withdraw 
counteroffer 

20 In addition, the neural network may be configured to 

generate the following outputs: 

public price offer 

message to other party - 
- - make offer 
25 -- withdraw offer 

accept 
reject 

counteroffer 
finished 

30 - message to manager 

current price offer 
other party's current price 
feature presence 
For a fully -autonomous agent, a neural network may 
35 be configured to receive the following inputs : 

other party's last price 
other party's current price 
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number of iterations 
agent's last public offer 
product characteristics 
In addition, the neural network may be configured to 
generate the following outputs: 

public price offer 
message to other party 

make offer 

withdraw offer 

accept 

reject 

counteroffer 
finished 

Moreover, for training purposes, additional 
information may also be used in the generation of suitable 
training records for any of the above agents, although such 
information is not directly input to the neural network. Such 
information may include: 

desired price and features 

agent autonomy price range (range of prices 
within which agent is authorized to negotiate 
autonomously) 

manager approval price range (range of prices 
within which agent requires approval from 
manager) 

acceptable iterations allowed without approval 
It should be appreciated that other agents 
implementing alternate negotiation strategies and decision 
making processes, and with other degrees of autonomy between 
fully -autonomous and fully -dependent , may be used in the 
alternative. Moreover, it should be appreciated that other 
functionality may be required for the autonomous operation of 
any of the agents in agent pool 42, and that such additional 
functionality may be implemented via procedural logic and/or 
neural networks. For example, functions such as 

initialization, maintenance, finding other agents or markets 
to interact with, etc, may also be utilized. However, as these 
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functions relate more to the basic operation of an agent, which 
is in general known in the art, these functions will not be 
discussed in any greater detail herein. 

Referring again to Fig. 1, as discussed above, agent 
manager 32 may select from multiple program modules of varied 
degrees of domain knowledge to perform a desired computer task. 
Agent manager 32 may utilize an agent from pool 42 having 
multiple program modules with varied degrees of domain 
knowledge, and then configure the agent to execute a selected 
program module based upon the objective criteria. 

For example, as shown in Fig. 11, agent manager 3 2 
may utilize an agent 300 that includes a main module 302 which 
oversees the operations of the agent, and a communications 
module 3 04 that handles communications between the agent and 
its manager, and between the agent and other parties. 
Moreover, one or more additional program modules 306-308 are^ 
interconnected to main module 302 to provide alternate 
functionality for the agent. Program module 306 implements a 
semi -autonomous functionality which operates in the manner 
disclosed above for the semi -autonomous agent of Fig. 8.. 
Program module 307 implements a fully -dependent functionality 
which operates in the manner disclosed above for the fully - 
dependent agent of Fig. 9. Program module 308 implements a. 
fully -autonomous functionality which operates in the manner 
disclosed above for the fully -autonomous agent of Fig. 10, 

Selection of which of modules 306 -308 may be made by 
agent manager, in the manner disclosed above with reference to 
Fig. 7. In the alternative, selection of program modules based 
upon an objective criteria may be made internally within the 
agent, e.g., using predetermined selection criteria (e.g., to 
employ different negotiation strategies depending upon the time 
of day, etc.), or using an optional reinforcement learning 
module 3 09 which operates in a similar manner to module 34 
discussed above. It should be appreciated that when selection 
is performed within an agent, selections may be updated from 
time to time to adapt the behavior of the agent in view of 
changing circumstances. 
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As also illustrated in Fig, 7, agent manager 32 may 
rely on one or more pools of program modules 42' to construct 
a task-specific agent depending upon an objective criteria. 
For example, as shown in Fig. 12, agent manager 32 may 
5 construct an agent 310 from a number of components. First, 
components which are generic to all permutations of an agent 
may be utilized, e.g., a main module 312 and communications 
module 314. 

Second, components may be selected from pluralities 

10 of alternate program modules. For example, in one embodiment 
where the decision logic of an agent is implemented in a neural 
network, a pool 320 of decision logic neural network modules 
may be provided, with another pool 322 of price range filters 
used to optimize the operation of the neural network modules 

15 for different price ranges. The use of the latter pool 322 
permits the modules in pool 320 to be price independent so 
that, when agent manager 3 2 is required to dispatch an agent 
to conduct transactions for a particular product in a 
particular price range, the agent manager merely selects an 

20 appropriate decision logic network module from pool 320 (e.g., 
semi -autonomous program module 316) and price range filter from 
^ pool 322 (e.g., price range C filter 318). Other manners of 
customizing the modules in pool 320 may be envisioned, e.g., 
selecting from different requirements filters to optimize an 

25 agent for different products. Also, both procedural and neural 
network implemented decision logic modules may be included in 
pool 320 so that, depending upon the risk to a dispatched 
agent, either type of logic may be utilized. 

Third, instead of selecting among alternate program 

30 modules, agent manager 32 may add additive program modules to 
enhance the domain knowledge of the dispatched agent. For 
example, in one embodiment, a pool of optional modules 3 24 may 
be provided with, among other modules, a time monitor module 
319 that permits the negotiation strategy of the agent to vary 

35 depending upon the time of day (e.g., for markets which tend 
to be more volatile at the end of the day, module 319 may 
implement a more conservative negotiation strategy than 
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Otherwise employed) . Other optional functionality may be 
implemented. For example, agent performance monitoring may 
optionally be included within agent 310 if desired. 

Any number of programming languages and underlying 
5 platforms may be utilized to implement agents consistent with 
the invention. For example, Java and other object oriented 
programming languages are particularly well suited for use with 
agents implementing multiple program modules having varied 
degrees of domain knowledge, as separate classes implementing 

10 each of the alternate program modules may be interconnected 
with one another in a single package. 

Other modifications may be made to the illustrated 
embodiments without departing from the spirit and scope of the 
invention. Therefore, the invention lies solely in the claims 

IB hereinafter appended. 
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Claims 

1 1. A method of handling a computer task using an 

2 intelligent agent, the method comprising the steps of: 

3 (a) based upon an objective criteria, selecting a 

4 selected intelligent agent from a plurality of intelligent 

5 agents having varied degrees of domain knowledge, wherein 

6 the plurality of intelligent agents are configured to 

7 handle a common computer task; and 

8 (b) dispatching the selected intelligent agent to 

9 handle the computer task. 

1 2. The method of claim 1, wherein the selecting step 

2 includes the step of adaptively selecting the selected 

3 intelligent agent using a reinforcement learning algorithm. 

1 3. The method of claim 2, further comprising the steps 

2 of: 

3 (a) obtaining performance information relating to 

4 the performance of the selected intelligent agent in 

5 handling the computer task; and 

6 (b) supplying the performance information to the 

7 reinforcement learning algorithm. 

1 4. The method of claim 2, wherein the reinforcement 

2 learning algorithm is implemented in an adaptive heuristic 

3 critic neural network. 

1 5. The method of claim 1, wherein the selecting step 

2 includes the steps of: 

3 (a) matching each of the plurality of intelligent 

4 agents with a value of the objective criteria; 

5 (b) determining a selected value of the objective 

6 criteria; and 

7 (c) selecting as the selected intelligent agent an 

8 intelligent agent matching the selected value of the 

9 objective criteria. 
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1 6, The method of claim 5, wherein the selecting step 

2 further includes the step of retrieving information for a 

3 selected computer task, wherein the determining step determines 

4 the selected value of the objective criteria using the 

5 retrieved information. 

1 7. The method of claim 1, wherein each of the plurality 

2 of intelligent agents is configured to conduct negotiations in 

3 an electronic commerce application, and wherein the domain 

4 knowledge for each of the plurality of intelligent agents is 

5 related to the autonomy delegated thereto. 

1 8. The method of claim 7, wherein the plurality of 

2 intelligent agents includes a semi -autonomous agent, a fully - 

3 autonomous agent, and a fully -dependent agent, 

1 9, The method of claim 7, wherein the objective criteria 

2 includes a risk that a dispatched agent is subjected to in 

3 negotiations. 

1 10. An apparatus for handling a computer task; 

2 comprising: 

3 a plurality of intelligent agents having varied 

4 degrees of domain knowledge, wherein the plurality of 

5 intelligent agents are configured to handle a common 

6 computer task, and wherein, based upon an objective 

7 criteria, a selected agent from the plurality of 

8 intelligent agents is dispatched to handle the computer 

9 task, 

1 11. The apparatus of claim 10, further comprising an 

2 agent manager configured to dispatch the selected intelligent 

3 agent based upon the objective criteria. 
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1 12. The apparatus of claim 11, further comprising a 

2 reinforcement learning module, coupled to the agent manager and 

3 configured to adaptively select intelligent agents based upon 

4 the performance of the plurality of intelligent agents in 

5 handling the computer task. 

1 13. The apparatus of claim 12, wherein the reinforcement 

2 learning module comprises an adaptive heuristic critic neural 

3 network. 

1 14. The apparatus of claim 11, wherein the agent manager 

2 is configured to retrieve information for a selected computer 

3 task, determine a selected value for the objective criteria for 

4 the selected computer task, and select as the selected 

5 intelligent agent one of the plurality of intelligent agents 

6 which is matched with the selected value of the objective 

7 criteria, 

1 15. The apparatus of claim 10, wherein each of the 

2 plurality of intelligent agents is configured to conduct 

3 negotiations in an electronic commerce application, and wherein 

4 the domain knowledge for each of the plurality of intelligent 

5 agents is related to the autonomy delegated thereto. 

1 16. The apparatus of claim 15, wherein the plurality of 

2 intelligent agents includes a semi - autonomous agent, a fully- 

3 autonomous agent, and a fully- dependent agent. 

1 17. The apparatus of claim 15, wherein the objective 

2 criteria includes a risk that a dispatched agent is subjected 

3 to in negotiations. 

1 18. A program product comprising: 

2 (a) a program configured to handle a computer task 

3 using an intelligent agent, the program comprising a 

4 plurality of intelligent agents having varied degrees of 

5 domain knowledge, wherein the plurality of intelligent 
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6 agents are configured to handle a common computer task, 

7 and wherein, based upon an objective criteria, a selected 

8 agent from the plurality of intelligent agents is 

9 dispatched to handle the computer task; and 

10 (b) a signal bearing media bearing the program, 

1 19. The program product of claim 18, wherein the signal 

2 bearing media is transmission type media. 

1 20. The program product of claim 18, wherein the signal 

2 bearing media is recordable media. 

1 21, A method of handling a computer task on a remote 

2 computer system using an intelligent agent, the method 

3 comprising the steps of: 

4 (a) determining a risk for the remote computer 

5 system; 

6 (b) based upon the risk for the remote computer 

7 system, selecting a selected intelligent agent from a 

8 plurality of intelligent agents having varied degrees of 

9 domain knowledge, wherein the plurality of intelligent 

10 agents are configured to handle a common computer task on 

11 the remote computer system; and ;^ 

12 (c) dispatching the selected intelligent agent to 

13 handle the computer task, 

1 22. The method of claim 21, further comprising the step 

2 of matching each of the plurality of intelligent agents with 

3 a risk level. 

1 23. The method of claim 22, wherein the matching step 

2 includes the step of adaptively matching each intelligent agent 

3 based upon the actual performance of the plurality of 

4 intelligent agents. 
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